Answering queries over incomplete data stream histories

نویسندگان

  • Alasdair J. G. Gray
  • Werner Nutt
  • M. Howard Williams
چکیده

Streams of data often originate from many distributed sources. A distributed stream processing system publishes such streams of data and enables queries over the streams. This allows users to retrieve and relate data from the distributed streams without needing to know where they are located. Stream data is important not only for its current values but also for past values produced. In order to support this, the history of the stream must be archived and stream processing systems must support history queries. However, one problem which then arises is that data streams published by distributed sources may have missing data values, e.g. due to a network failure. Since the stream has missed some values, the stored history of the stream contains gaps. This paper considers the effects of missing information on the answers generated for history queries. The assumptions about the data streams are analysed so that techniques for detecting missing values can be developed. A model for representing the incomplete information has been developed together with an approach to answering history queries where relevant data is missing. Case studies have been drawn from the context of the r-gma system, which integrates distributed data streams to provide information and monitoring data about resources on a Grid. However, the model and techniques considered are general and could be applied wherever there is a need to query the history of distributed data streams.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Answering Arbitrary Conjunctive Queries over Incomplete Data Stream Histories

Streams of data often originate from many distributed sources. A user wanting to query the streams should not need to know from where each stream originates but should be provided with a global view of the streams. R-GMA is a system that integrates distributed data streams to provide a global view of all the streams for users to query. R-GMA has been developed as a grid information and monitori...

متن کامل

ارائه روشی پویا جهت پاسخ به پرس‌وجوهای پیوسته تجمّعی اقتضایی

Data Streams are infinite, fast, time-stamp data elements which are received explosively. Generally, these elements need to be processed in an online, real-time way. So, algorithms to process data streams and answer queries on these streams are mostly one-pass. The execution of such algorithms has some challenges such as memory limitation, scheduling, and accuracy of answers. They will be more ...

متن کامل

Knowledge and Information Systems REGULAR PAPER

In some business applications such as trading management in financial institutions, it is required to accurately answer ad hoc aggregate queries over data streams. Materializing and incrementally maintaining a full data cube or even its compression or approximation over a data stream is often computationally prohibitive. On the other hand, although previous studies proposed approximate methods ...

متن کامل

I-SQE: A Query Engine for Answering Range Queries over Incomplete Spatial Databases

Spatial database systems built on top of distributed and heterogeneous spatial information sources such as conventional spatial databases underlying Geographical Information Systems (GIS), spatial data files and spatial information acquired or inferred from the Web, suffer from data integration and topological consistency problems. These issues make the globally-integrated spatial database inco...

متن کامل

Towards Temporal Fuzzy Query Answering on Stream-based Data

For reasoning over streams of data ontology-based data access is a common approach. The method for answering conjunctive queries (CQs) over DL-Lite ontologies in this setting is by rewritings of the query and evaluation of the resulting query by a data base engine. For streambased applications the classical expressivity of DL-Lite lacks means to handle fuzzy and temporal information. In this pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJWIS

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2007